AITopics | generic representation

Collaborating Authors

generic representation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

PowerPM: Foundation Model for Power Systems

Neural Information Processing SystemsDec-27-2025, 08:28:13 GMT

The proliferation of abundant electricity time series (ETS) data presents numerous opportunities for various applications within power systems, including demand-side management, grid stability, and consumer behavior analysis. Deep learning models have advanced ETS modeling by effectively capturing sequence dependence. However, learning a generic representation of ETS data for various applications is challenging due to the inherently complex hierarchical structure of ETS data. Moreover, ETS data exhibits intricate temporal dependencies and is susceptible to the influence of exogenous variables.

artificial intelligence, machine learning, proceedings, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.59)

Add feedback

PowerPM: Foundation Model for Power Systems

Neural Information Processing SystemsMay-27-2025, 17:34:37 GMT

artificial intelligence, machine learning, powerpm, (9 more...)

Neural Information Processing Systems

Industry: Machinery > Industrial Machinery (0.87)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)

Add feedback

Fine-tuned network relies on generic representation to solve unseen cognitive task

Lin, Dongyan

arXiv.org Artificial IntelligenceJun-27-2024

We aim to understand the extent to more on generic pretrained representation, or develop which fine-tuned models depend on their pretrained representations brand new task-specific solutions? Here, to solve a novel task. To this end, we compare the we fine-tuned GPT-2 on a context-dependent representations after fine-tuning with those developed by decision-making task, novel to the model but GPT-2 optimized solely on this task from scratch. We chose adapted from neuroscience literature. We compared this task not only because it is novel but also because its its performance and internal mechanisms grounding in neuroscience allows us to explore the data with to a version of GPT-2 trained from scratch on computational neuroscience methods and make direct comparisons the same task. Our results show that fine-tuned between representations in biological and artificial models depend heavily on pretrained representations, neural networks.

accuracy, generic representation, representation, (16 more...)

arXiv.org Artificial Intelligence

2406.18926

Country:

North America > Canada > Quebec > Montreal (0.14)
Europe > Austria > Vienna (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Towards a Generic Representation of Combinatorial Problems for Learning-Based Approaches

Boisvert, Léo, Verhaeghe, Hélène, Cappart, Quentin

arXiv.org Artificial IntelligenceMar-12-2024

In recent years, there has been a growing interest in using learning-based approaches for solving combinatorial problems, either in an end-to-end manner or in conjunction with traditional optimization algorithms. In both scenarios, the challenge lies in encoding the targeted combinatorial problems into a structure compatible with the learning algorithm. Many existing works have proposed problem-specific representations, often in the form of a graph, to leverage the advantages of \textit{graph neural networks}. However, these approaches lack generality, as the representation cannot be easily transferred from one combinatorial problem to another one. While some attempts have been made to bridge this gap, they still offer a partial generality only. In response to this challenge, this paper advocates for progress toward a fully generic representation of combinatorial problems for learning-based approaches. The approach we propose involves constructing a graph by breaking down any constraint of a combinatorial problem into an abstract syntax tree and expressing relationships (e.g., a variable involved in a constraint) through the edges. Furthermore, we introduce a graph neural network architecture capable of efficiently learning from this representation. The tool provided operates on combinatorial problems expressed in the XCSP3 format, handling all the constraints available in the 2023 mini-track competition. Experimental results on four combinatorial problems demonstrate that our architecture achieves performance comparable to dedicated architectures while maintaining generality. Our code and trained models are publicly available at \url{https://github.com/corail-research/learning-generic-csp}.

architecture, combinatorial problem, constraint, (14 more...)

arXiv.org Artificial Intelligence

2403.06026

Country:

Europe > Austria > Vienna (0.14)
North America > Canada > Quebec > Montreal (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

Balanced Supervised Contrastive Learning for Few-Shot Class-Incremental Learning

Yoon, In-Ug, Choi, Tae-Min, Kim, Young-Min, Kim, Jong-Hwan

arXiv.org Artificial IntelligenceMay-26-2023

Few-shot class-incremental learning (FSCIL) presents the primary challenge of balancing underfitting to a new session's task and forgetting the tasks from previous sessions. To address this challenge, we develop a simple yet powerful learning scheme that integrates effective methods for each core component of the FSCIL network, including the feature extractor, base session classifiers, and incremental session classifiers. In feature extractor training, our goal is to obtain balanced generic representations that benefit both current viewable and unseen or past classes. To achieve this, we propose a balanced supervised contrastive loss that effectively balances these two objectives. In terms of classifiers, we analyze and emphasize the importance of unifying initialization methods for both the base and incremental session classifiers. Our method demonstrates outstanding ability for new task learning and preventing forgetting on CUB200, CIFAR100, and miniImagenet datasets, with significant improvements over previous state-of-the-art methods across diverse metrics. We conduct experiments to analyze the significance and rationale behind our approach and visualize the effectiveness of our representations on new tasks. Furthermore, we conduct diverse ablation studies to analyze the effects of each module.

artificial intelligence, machine learning, representation, (18 more...)

arXiv.org Artificial Intelligence

2305.16687

Country: Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)

Genre: Research Report > New Finding (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

One Representation to Rule Them All: Identifying Out-of-Support Examples in Few-shot Learning with Generic Representations

Kvinge, Henry, Howland, Scott, Courts, Nico, Phillips, Lauren A., Buckheit, John, New, Zachary, Skomski, Elliott, Lee, Jung H., Tiwari, Sandeep, Hibler, Jessica, Corley, Courtney D., Hodas, Nathan O.

arXiv.org Artificial IntelligenceJun-2-2021

The field of few-shot learning has made remarkable strides in developing powerful models that can operate in the small data regime. Nearly all of these methods assume every unlabeled instance encountered will belong to a handful of known classes for which one has examples. This can be problematic for real-world use cases where one routinely finds 'none-of-the-above' examples. In this paper we describe this challenge of identifying what we term 'out-of-support' (OOS) examples. We describe how this problem is subtly different from out-of-distribution detection and describe a new method of identifying OOS examples within the Prototypical Networks framework using a fixed point which we call the generic representation. We show that our method outperforms other existing approaches in the literature as well as other approaches that we propose in this paper. Finally, we investigate how the use of such a generic point affects the geometry of a model's feature space.

artificial intelligence, encoder, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2106.01423

Country:

North America > Canada (0.46)
North America > United States (0.46)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Scenario-Transferable Semantic Graph Reasoning for Interaction-Aware Probabilistic Prediction

Hu, Yeping, Zhan, Wei, Tomizuka, Masayoshi

arXiv.org Artificial IntelligenceApr-6-2020

Accurately predicting the possible behaviors of traffic participants is an essential capability for autonomous vehicles. Since autonomous vehicles need to navigate in dynamically changing environments, they are expected to make accurate predictions regardless of where they are and what driving circumstances they encountered. A number of methodologies have been proposed to solve prediction problems under different traffic situations. However, these works either focus on one particular driving scenario (e.g. highway, intersection, or roundabout) or do not take sufficient environment information (e.g. road topology, traffic rules, and surrounding agents) into account. In fact, the limitation to certain scenario is mainly due to the lackness of generic representations of the environment. The insufficiency of environment information further limits the flexibility and transferability of the predictor. In this paper, we propose a scenario-transferable and interaction-aware probabilistic prediction algorithm based on semantic graph reasoning, which predicts behaviors of selected agents. We put forward generic representations for various environment information and utilize them as building blocks to construct their spatio-temporal structural relations. We then take the advantage of these structured representations to develop a flexible and transferable prediction algorithm, where the predictor can be directly used under unforeseen driving circumstances that are completely different from training scenarios. The proposed algorithm is thoroughly examined under several complicated real-world driving scenarios to demonstrate its flexibility and transferability with the generic representation for autonomous driving systems.

data mining, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2004.03053

Country:

North America > United States > California > Alameda County > Berkeley (0.14)
North America > United States > Illinois (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
(3 more...)

Add feedback

How to learn the maths of Data Science using your high school maths knowledge

#artificialintelligenceOct-15-2019, 21:18:15 GMT

This post is a part of my forthcoming book on Mathematical foundations of Data Science. In this post, we use the Perceptron algorithm to bridge the gap between high school maths and deep learning. As part of my role as course director of the Artificial Intelligence: Cloud and Edge Computing at the University..., I see more students who are familiar with programming than with mathematics. They have last learnt maths years ago at University. And then, suddenly they find that they encounter matrices, linear algebra etc when they start learning Data Science.

artificial intelligence, machine learning, perceptron, (13 more...)

#artificialintelligence

Industry: Education > Educational Setting > K-12 Education > Secondary School (0.62)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (1.00)

Add feedback

Tutorial: Declarative Machine Learning

#artificialintelligenceMar-27-2016, 16:00:49 GMT

Machine learning explores the study and construction of algorithms that learn and make predictions based on data. In the field of machine learning, data scientists, who specialize in analyzing data, are responsible for writing and modifying such algorithms. Initially, a data scientist writes an algorithm based on a set of data features. This is generally an iterative process in which the data scientist explores different algorithms for predictive purpose. In this process, the amount of data and the number of features chosen for analysis may change.

algorithm, artificial intelligence, machine learning, (15 more...)

#artificialintelligence

Genre: Research Report > New Finding (0.37)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.99)

Add feedback

Toward a generic representation of random variables for machine learning

Marti, Gautier, Very, Philippe, Donnat, Philippe

arXiv.org Machine LearningSep-3-2015

This paper presents a pre-processing and a distance which improve the performance of machine learning algorithms working on independent and identically distributed stochastic processes. We introduce a novel non-parametric approach to represent random variables which splits apart dependency and distribution without losing any information. We also propound an associated metric leveraging this representation and its statistical estimate. Besides experiments on synthetic datasets, the benefits of our contribution is illustrated through the example of clustering financial time series, for instance prices from the credit default swaps market. Results are available on the website www.datagrapple.com and an IPython Notebook tutorial is available at www.datagrapple.com/Tech for reproducible research.

artificial intelligence, machine learning, random variable, (18 more...)

arXiv.org Machine Learning

1506.00976

Country:

North America > United States (0.14)
Europe > France > Île-de-France > Paris > Paris (0.04)
Europe > United Kingdom > England > Greater London > London > City of London (0.04)
Asia > Japan (0.04)

Genre: Research Report (0.64)

Industry: Banking & Finance > Trading (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.94)

Add feedback